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Abstract 

Let G be a graph on n vertices in which every induced subgraph ons = log 3 n vertices has 
an independent set of size at least t = logn. What is the largest q = q(n) so that every such G 
must contain an independent set of size at least q ? This is one of several related questions raised 
by Erdos and Hajnal. We show that q(n) = 0(log 2 n/ log log n), investigate the more general 
problem obtained by changing the parameters s and t, and discuss the connection to a related 
Ramsey-type problem. 



1 Introduction 

What is the largest / = f(n) so that every graph Gonn vertices in which every induced subgraph 
on log 2 n vertices has an independent set of size at least log n, must contain an independent set of 
size at least / ? This is one of several related questions considered by Erdos and Hajnal in the late 
80s. The question appears in [3J, where Erdos mentions that they thought that f(n) must be at least 
n i/2-e^ k u j. they could not even prove that it is at least 21ogn. As a special case of our main results 
here we determine the asymptotic behavior of f(n) up to a factor of log log n, showing that in fact 
it is much smaller than one may suspect (and yet much bigger than logn): 

log^ Qg2 
log log n 

Another specific variant of the above question, discussed in |3j, is the problem of estimating the 
largest q = q(n) so that every graph Gonn vertices in which every induced subgraph on log 3 n 
vertices has an independent set of size at least logn, must contain an independent set of size at least 
q. Here, too, one may tend to believe that q(n) is large, and specifically it is mentioned in [3] that 
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probably q(n) > log 3 n, but the correct asymptotic behavior of q(n) is smaller. In this case, our 
results determine the asymptotic behavior of q(n) up to a constant factor, implying that 

,<„) = efcfe. (2 ) 

log log n 

Both problems above are special instances of the general problem of understanding the asymptotic 
behavior of the function f(n,s,t) defined as follows. For n > s > t, let / = f(n,s,t) denote the 
largest integer / so that every graph G on n vertices in which every induced subgraph on s vertices 
has an independent set of size at least t, must contain an independent set of size at least /. In this 
note we investigate the asymptotic behavior of /, and obtain rather tight bounds for this behavior 
for most interesting values of the parameters. Our results provide much less satisfactory information 
about a closely related Ramsey-type problem of Erdos and Hajnal discussed in [3], which is the 
following. For which functions h(n) and g(n), where n > g(n) > h(n) 2 3> 1, is there a graph on n 
vertices in which every induced subgraph on g{n) vertices contains a clique of size h{n) as well as 
an independent set of size h(n) ? In particular, Erdos and Hajnal conjectured that there is no such 
graph for g(n) = log 3 n and h(n) = logn; our results here do not settle this conjecture, and only 
suffice to show that there is no such graph with g(n) = clog 3 n/ log log n and h(n) = logn, for some 
absolute positive constant c. 

The rest of this note is organized as follows. In Section 2 we state our main results concerning 
the behavior of the function f(n,s,t). The proofs are described in Section 3. The final Section 4 
contains a few remarks, including the (simple) connection between the study of / and the Ramsey- 
type question discussed above. 

Throughout the note we omit all floor and ceiling signs, whenever these are not crucial. We 
always assume that the number n of vertices of the graphs considered here is large. All logarithms 
are in the natural base e. 



2 The main results 

The following two theorems provide lower bounds for the independence numbers of graphs in which 
every induced subgraph of size s contains an independent set of size t. 

Theorem 2.1 Let t < s < n/2, and let G be a graph of order n such that every induced subgraph 
of G on s vertices contains an independent set of size t. Denote k = Lt^tJ- Then G contains 

an independent set of size at least ft if k < 2 logn and of size at least ( iog(fc7iog n) ) ^ 

k > 2 log n. 

Theorem 2.2 Let 2t < s < n/2, and let G be a graph of order n such that every induced subgraph 
of G on s vertices contains an independent set of size t. Then G contains an independent set of size 
at least n(t^). 
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The next result shows that there are graphs with relatively small independence numbers in which 
every induced subgraph of size s contains an independent set of size t. 

Theorem 2.3 For every sufficiently large t and 2t < s < n/2 there exists a graph G on n vertices 
with independence number 

f / ri \2t/(s-t) \ 

a(G)<0[t[-) log(n/i)J 
such that every induced subgraph of G of order s contains an independent set of size t. 

For certain values of t and s one can improve the previous result as follows. 

Theorem 2.4 Let t < s < n/2, where s < e 2t , and assume, further, that either there exists a 
constant 5 > such that (s/t) 1 ^ 5 > logn or s/t = f2(logn) and there exists a constant 7 > such 
that log t > log 7 n . Then there exists a graph G on n vertices with 

such that every induced subgraph of G of order s contains an independent set of size t. 



3 The proofs 

Proof of Theorem 12.11 Suppose that G contains t — 1 disjoint cliques whose union U has size at 
least s. Then the size of the largest independent set in U is bounded by t — 1, since an independent 
set can intersect each of the t — 1 cliques in at most one vertex. This contradicts the property of 
G. Therefore if G' is a graph obtained from G by deleting the vertices of t — 1 disjoint cliques with 
maximum union we have that |V(G")| > n — s > n/2. We also have that the largest clique in G' 
has size at most k = Lt~[J- Otherwise, by the above discussion, G will contain t — 1 disjoint cliques 
each of size at least k + 1, whose union has at least (k + l)(t — 1) > s vertices. To finish the proof 
we apply the classical bound of Erdos and Szekeres [1] (see also [5]) for the usual graph Ramsey 
numbers. This result asserts that the maximum possible number of vertices in a graph with neither 
a clique of size k + 1 nor an independent set of size i + 1 is at most ) • By this estimate, G' 

contains an independent set of size I, where ( k ^) > n/2. Thus (j^-^—^j > n/2 and therefore if 

k < 2 logn we get a(G) > t > Q^kn 1 /^ . On the other hand, if k > 2 logn we use the estimate 

f<k+Q/\ > (k+t) = ( k + e ) > n/2. This gives that t > fl ( i og( ]ffi" g n) ) and completes the proof of the 
theorem. □ 

Proof of Theorem 12.21 Let r be the largest integer such that 



3 



It is easy to check that r = f2 ( ! og l W y^ ) . To prove the theorem we construct a sequence of pairwise 



Iog(s/t) . 

disjoint independent sets X%, . . . ,X r together with a sequence of nested subsets Vq = V{G) D V\ D 
. . . D y r _i such that the following holds. Each Xj is a subset of of size t/2, X{ n V£ = 0, there 
are no edges between Xj and Vi and > 4 *3 a f° r all 1 < i < r — 1. Then the union of all 

sets Xi forms an independent set in G of size rt/2 = $7 (t 1 • 

Assuming the sets X,-, V} have already been constructed for all j < i, construct Xj and Vi as 
follows. Let m be the size of Vi-\. The inductive hypothesis implies that 

t 2 V'\ , / t 2 ^ r_1 



Since every subset of order s in Vi_i contains t independent vertices and every independent set of 
size t belongs to at most C^Z*) subsets of size s we have that Vi-\ contains at least (™) / (™~/) 
independent sets of size t. Therefore, using that m > s > 2t and Q < (^) , we conclude that there 
exist a subset Xi of Vi-\ of size t/2 which is contained in at least 

(?) m! (*-*)! 



Im—t 



* > - 



_J(™ 2 ) (m-t)! s! V*/ 2 / ~V S ; V 2 e"i 

_ r^v /2 / e (w-)V /2 



V2es 2 / \ t/2 
" V t/2 

independent sets of size t. This implies that Vi-\ contains at least ( 4e ^ ^ subsets of size t/2 whose 
union with Xi forms an independent set of size t. Let Vi be the union of all these subsets. By 
definition, for every vertex of Vi there is an independent set which contains it together with Xi, 
so there are no edges from Xi to V,. Also, it is easy to see that the size of Vi must be at least 
4 a 2 a Tn = 4 a This completes the construction step and the proof of the theorem. □ 

Proof of Theorem 12.31 We prove the theorem by considering an appropriate random graph. As 
usual, let G n , p denote the probability space of all labeled graphs on n vertices, where every edge 
appears randomly and independently with probability p = p(n). We say that the random graph 
possesses a graph property V almost surely, or a.s. for brevity, if the probability that G njP satisfies 
V tends to 1 as n tends to infinity. Clearly, it is enough to show that there is a value of the edge 
probability p such that G ntP satisfies the assertion of the theorem with positive probability. 

Let p = -^- t (j) 2t ^ s . We claim that almost surely every subset of G = G niP of size s spans 
at most s 2 /(2t) — s/2 edges. Indeed, the probability that there is a subset of size s which violates 
this assertion is at most 

'"V »72 V/«-/2 < (my - 
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< 



(?) (Ml) ) sr-*-*), 
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(where the o(l)-term tends to zero as s tends to infinity). This implies that with high probability 
every subgraph of G on s vertices has average degree d < s/t — 1. Therefore by Turan's theorem it 
contains an independent set of size at least > t. On the other hand, it is well known (see, e.g., 
[2]), that almost surely the independence number of G ntP is bounded by 

a(G n>p ) < O (p- 1 log np) < O U(-) log(n/t)J . 

This implies that a.s. G n ,p satisfies the assertion of the theorem and completes the proof. □ 
For the proof of Theorem 12.41 we need the following lemma. 



Lemma 3.1 Let G = G StP be a random graph, assume sp — > oo and fix e > 0. Then the probability 



that the independence number of G is at most | log(sp) is less than e 



-s(sp) 1 - 36 / 2 



Proof (of lemma). Let k = | log(sp). To prove the lemma we use the standard greedy algorithm 
which constructs an independent set by examining the vertices of the graph in some fixed order and 
by adding a vertex to the current independent set whenever possible. The behavior of this algorithm 
for random graphs can be analyzed rather accurately (see, e.g., [2]). At iteration i of our procedure 
we use the greedy algorithm to find a maximal (with respect to inclusion) independent set Ii in 
the remaining vertices of G. If > k we stop. Otherwise we delete the vertices of Ii from G 
and continue. We stop when the number of remaining vertices drops below s/2. Note that during 
iteration number i we only expose edges incident to Jj, therefore the remaining vertices still form 
a truly random graph. Given a set / the probability that a fixed vertex of G is adjacent to some 
vertex of I is 1 — (1 — py^K Therefore the probability that a fixed I is maximal, where |/| < k, is 
at most (l — (1 — p) fc ) S//2 . By definition, when the iteration fails, the random graph must contain 
a maximal independent set of size less than k (and hence also a set of size exactly k so that every 
remaining vertex is adjacent to at least one of its members). Thus the probability of this event is at 
most (fc)(l — (1 — p) k y^ 2 . Moreover, the outcomes of different iterations depend on disjoint sets of 
edges and therefore are independent. Finally note that if G has no independent set of size k, then 
the number of iterations is at least s/(2k). This implies that the probability of such an event is 
bounded by 



. \(:)(i-(i-p)r ,2 )' m <C4Y /2 e-£^ 



x kj x v rj > ) - v k 



Proof of Theorem 12.41 Suppose that (s/i) 1 ^ > logn for some fixed 5 > and consider the 
random graph G = G n ^ with p = - lo |^^ . (Note that p < 1 as s < e 2t .) As already mentioned 
above a.s. the independence number of this graph is bounded by 0(~ log(np)) = 0( log * s ^ log(n/i)) . 
Here we used that log(s/i) < s/t < n/t and hence log ( j log(s/t)) < 21og(n/t). Also, by Lemma 1331 
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(with e = (5/2), the probability that G contains an induced subgraph of order s with no independent 
set of size ^ log(sp) > t is at most 

^je-M 1 "^ 4 < nSe - S logW„ = o(1) _ 

Here we used that log(s/t) — > oo and hence (sp) 1_3<5//4 > (s/t) i_3<5//4 > log 1+5 / 4 n. Therefore with 
high probability G satisfies the first assertion of the theorem. 

To prove the second part of the theorem suppose that s/t = f2(logn) and logt > log 7 n for some 
constant 7 > 0. By the previous paragraph we can also assume that s/t < log 2 n. Let G = G n)V with 
p = 7l g4Q S / ■ Again we have that a.s. a(G) < 0( log ^^ log(n/t)). Since sp = (log n log log n), the 
probability that there is a subset of size s in G which spans at least 2s 2 p edges is bounded by 

^ ^ { 2 ^jp 2s2p < n S P p 2s2p = n S ^~y S P < e slogn e -n(slognloglogn) _ 

Also, since s/t < sp < t°^\ we have that the probability that G contains a subset of order k = @(sp) 
which spans more than /c 2_7//2 edges is at most 

t) (p-^)^ 2 " 772 ^ n\kpf- 112 < e kio g n e -n(k(s/t)^io g t) 

< gfclogng-C^fclog^/ 2 ,!) _ 

Let if be an induced subgraph of G of order s. Since the number of edges in H is a.s. at 
most 2s 2 p it contains an induced subgraph H 1 of order s/2 with maximum degree at most d = 8sp. 
By the above discussion, we also have that the neighborhood of every vertex of H' spans at most 
edges and therefore through every vertex in H' there are at most t < d 2 ~^/ 2 triangles. Now 
we can use the known estimate (Lemma 12.16 in [2], see also [T] for a more general result) on the 
independence number of a graph containing a small number of triangles. It implies that 

a(H') > O.l^p (logd- ilogt) > 0.1^ (logd- ilogd 2 ^ 2 ) 
This shows that G a.s. satisfies the second assertion of our theorem and completes the proof. □ 



4 Remarks 

Theorems 12 . 1 1 and 12.31 show that if s/t = 0(1) and t > 1 then f(n, s, t) = n 6 ' 1 ', whereas if s/t ^> 1 and 
t = n°W then f(n,s,t) = n o{l \ and if s/t > O(logn) and t < (logra)^ 1 ) then f(n,s,t) < (logn) ^). 

Theorems 12 . 21 and 12 .41 determine the asymptotic behavior of the function f(n, s, t) up to a constant 
factor for a wide range of the parameters. We do not specify here all this range, and only observe 
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that in particular, for every fixed fx > 0, f(n, log 2+ ^ n, logn) = 0( 




). For [i = 1 this implies 



([2]) . The estimate ([I]) follows from Theorems 12.21 and 12.31 

The connection between the Ramsey-type question described in Section 1 and the function / is 
the following simple fact. 



then there is no graph on n vertices in which every induced subgraph on s vertices contains a clique 
of size t and an independent set of size t. 

Proof: Assuming there is such a graph G, observe that by the definition of / it contains an 
independent set I\ of size / = /(n/2, s, t). Omit this set, and observe that the induced graph on the 
remaining vertices, assuming there are at least n/2 of them, contains another independent set I2 of 
size /. Repeating this process t — 1 times (or until the union of the independent sets obtained is of 
size at least n/2 > s), we get an induced subgraph of G on min{n/2, (t — 1)/} > s vertices, which 
is (t — l)-colorable (as it is the union of t — 1 independent sets), and hence cannot contain a clique 
of size t. This is a contradiction, proving the assertion of the claim. 

In particular, for t = logn and s = clog 3 n/ log logn, where c is a sufficiently small positive 
absolute constant, it is not difficult to check that the assumption in ([3]) holds, by Theorem 12.21 

It will be interesting to close the gap between our upper and lower bounds for the function 
f(n,s,t). It will also be interesting to know more about the Ramsey-type question of Erdos and 
Hajnal described in the introduction, and in particular, to decide if there exists a graph on n vertices 
in which every induced subgraph on, say, log 100 n vertices, contains a clique of size at least log n and 
an independent set of size at least logn. 
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Claim: If 



n/2>s>t and (i - l)/(n/2, s, t) > s 



(3) 
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